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Field of thg Invgntion 

This invention relates generally to methods, algorithms, and computer 
software programs for optimizing the performance of advertisements and other 
messages or content, and more particularly to methods, algorithms, and computer 
software programs for optimizing the performance of advertisements, messages, and 
other content in an interactive measurable medium such as a global network of 
interconnected information devices and appliances, and most particularly to 
optimization of performance of banner ads presented on internet web sites. 

BACKGROUND 

This invention relates to method, procedure, algorithm, system, and 
computer program for improving and attempting to optimize the performance of 
marketing campaigns in which advertisements or other messages are distributed over 
an interactive measurable medium such as the Internet. When the message is an 
advertisement, the campaign involves a list of ad alternatives and a target customer 
population. The goal of the marketer is to allocate the ad alternatives to the customer 
population to optimize business objectives such as maximizing the number of 
responses received. When the message is other than an advertisement, the goal is to 
otherwise allocate messages to optimize analogous business or campaign objectives, 
typically measured by the number of successes or successfiil responses. In this 
description, the term "ad" has the same meaning and is used interchangeably with 
the term "advertisement". 

In large part due to the particular applicability of the invention to 
advertisements on the Intemet, this background description focuses on internet 
advertising to establish one context of the invention and to differentiate the 
invention from conventional systems and methods. It is to be understood however, 
that the invention is not to be interpreted to be limited only to an Intemet advertising 
environment. 
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Various systems currently exist for the delivery and tracking of 
advertisements on the internet, for instance, ad servers for serving and tracking 
"banner ads" on a web page. The users of these ad delivery or ad server systems 
have access to data on the performance of all the ads on all the locations. This data is 
5 updated by the delivery and tracking system on a periodic basis. The users are also 
provided with an array of parameters to configure the delivery and tracking system. 
In a typical conventional situation, an advertiser buys advertising space (ad space) 
on a number of web sites. The advertising space buy on each web site consists of a 
number of categories. Such categories may correspond to different sections within 
10 that web site, where a section is a specific web page or a set of related web pages 
2 within the site. A category may also correspond to keywords searched by a customer 

^ on a search engine. The term "zone" will be used to represent a unique site and 

Qi category combination. There may typically be a number of banners that an advertiser 

i ; I 

5 wishes to deploy across these zones. A banner is either a graphic image that 

15 announces the name or identity of a site or is an advertising image. An impression 
O occurs when an Internet visitor sees a banner. A clickthrough occurs when a visitor 

p=J to a zone clicks on a banner. This redirects the visitor to the page on the advertiser's 

website. 



20 The term "placement" is used to refer to a particular banner-zone 

combination. The fraction of impressions (relative to the total number of 
impressions associated with the particular zone) that should be allocated to a 
particular banner alternative is an important placement parameter that the advertiser 
can select and modify, to boost the advertising campaign performance. 

25 

Impressions can occur at any time — ^whenever someone visits the appropriate 
page of a web site. However, the reports are typically updated at discrete times. We 
will call the intermediate time between two reports a stage. At the end of each stage, 
the results are available for that stage. In particular, the following information is 
30 available for each placement: (1) the number of impressions delivered during a 
stage, and (2) the number of clickthroughs generated during a stage. 
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Additionally this information (that is, the number of impressions delivered 

Hiirincr o ctarrf* ar»r1 mimK#ar nf r»lir»lH-V»rQiirrVic fyf^np^r^feH durinU E StaHS^ maV bs 

available separately for: (a) different time slots within a stage (e.g. hour of day, if 
each stage is a day); (b) different frequency levels i.e. the number of times that an 
5 ad was seen by the customer; (c) different operating systems used by the machine on 
which the customer is logged on; (d) different internet browsers used by the 
customer; and(e) different domain addresses of the computer on which the customer 
is logged on. This list is exemplary and not intended to be exhaustive. 

10 In conventional systems and methods, these reports are provided in printed 

form or in the electronic equivalent of printed form, and are manually analyzed by 
trained analysis personnel to derive new, improved advertisement configurations. 
For example, they are analyzed in an attempt to optimize the clickthroughs 
generated by a pool of banner alternatives for a given zone, a given frequency level, 

15 and the like configuration information. This manual process is tedious and error- 
prone and has an inherent delay between the period of data collection and the time 
new advertisements are to be placed because of the large amount of data to be 
analyzed and the large number of parameters to be modified. Even if errors are not 
made and the user is able to overcome the tedium of the process, it is unlikely to 

20 yield optimal or even near-optimal recommendations for advertisement 
configurations. This is especially true in light of the typical delay of from a day to a 
week that elapses between data collection, analysis, and a new or modified ad 
campaign based on the analysis. 

25 Optimization to provide an^ effective advertising ^ampaign is in essence a 

multi-dimensional optimization problem, that by-and-large cannot be timely solved 
using conventional tools, methods, or systems. It is noted that these problems exist 
substantially independent of the type of advertisement or message, and that such 
issues and problems exist relative to advertisements for products and services, 

30 political campaigns, ballot measures and initiatives, media programming, lobbying. 
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surveys, polling, news headlines, sports scores, as well as other directed marketing, 
promotions, surveys, news, information, other content generally, and the like. 

Therefore, there remains a need for an automated system for optimizing 
5 allocation parameters for advertisement altematives or message alternatives. There 
also remains a need for an automated system and method for rapidly and efficiently 
executing the optimized allocation parameters to place the advertisement or message 
on the Internet or other local or global communication system. More particularly 
there remains a need for an optimization procedure or algorithm that utilizes 
10 available message performance information (for example, ad performance 
information) and generates recommendations for maintaining good performance or 
for improving performance during a subsequent stage of the campaign or optimizing 
performance of the entire campaign. 

15 There also remains a need for a system and method that can leam and 

optimize across the various other parameters that can be reconfigured in 
advertisement delivery systems also commonly referred to as ad servers. For 
example, there remains a need for an ad server system and method that permits an 
advertiser to display different banners (or other content or messages) based on a 

20 time-of-day user web browsing profile which may include geographic location 
information, demographic information, or the like, as well as other user targeting 
information. 

There also remains a need for an operating model that provides the optimized 
25 allocations for banner ad altematives or message altematives automatically on an 
interconnected network of computers or other information devices or appliances 
without significant human intervention. 

These and other needs in conventional systems and methods are solved by 
30 the inventive system and method, particularly by the inventive optimization method 
and algorithm and computer software implementations of the inventive optimization 
algorithm and method. 
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FIG. 1 is an illustration showing a system according to an embodiment of the 
invention with which the inventive method and computer program may be used. 

FIG. 2 is a diagrammatic illustration showing a flow chart of an embodiment 
of the inventive optimization procedure. 

SUMMARY 

The invention provides method, procedxxre, algorithm, system, and computer 
program for improving and attempting to optimize the performance of messaging 
campaigns, particularly to marketing campaigns in which advertisements or other 
messages are distributed over an interactive measurable medium such as the 
Intemet. When tl^emessage is an advertisenient^the campaign involves a list of ad 
altematives and a target customer population. The goal of the message manager or 
marketing manager is to allocate the ad altematives to the customer population to 
optimize business objectives such as maximizing the number of positive responses 
received. When the message is other than an advertisement, the goal is to otherwise 
allocate messages to optimize analogous business or campaign objectives, typically 
measured by the number of successes or successful responses. 

An optimization procedure is implemented on a computer that is 
programmed to retrieve message performance information and to generate 
recommended message allocations for a next stage in a multistage messaging 
campaign to achieve messaging goals. In one aspect the invention provides a 
method for improving the stage-to-stage performance of a message in a multi-stage 
message campaign in an interactive measurable medium. The interactive 
measurable medium may for example, be the Intemet or other distributed set of 
interconnected computer or information appliances. The measurable quality of this 
medium lies in part on its ability to measure responses to messages of different 
types. The method includes reading the prior stage message state pertaining to a 
prior stage in a message campaign, where the prior stage message state includes a 
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cumulative number of trials and a cumulative number of successes for a particular 
message at the end of the prior stage. The message performance results representing 
message trials and message successes from the previous stage based on the prior 
stage state are then read; and a current (or new) message state is computed on the 
5 basis of the prior stage message state and the message performance results. Finally, • 
a current (or new or updated) message allocation is generated based on the current 
message state. Normally, the newly determined current message state is stored as 
prior stage state for a next iteration of the procedvire. Desirably, but optionally, the 
cumulative number of trials and the cumulative number of successes include 

10 discounted cumulative number of trials and discounted cumulative number of 
successes; and generating a current message allocation includes applying a message 
allocation constraint. This inventive procedure may be applied to various messages 
including for example, web site banner advertisements, electronic advertisements 
generally, email messages and promotions, and the like. In one embodiment, the 

15 messages are intemet web site banner ads, trials are impressions of the ads presented 
to the user, and successes are measured by clickthroughs from the banner ad to the 
web site associated with the banner ad. 

Computer system, computer program, and computer program product 
20 associated with the inventive method and procedures are also provided by the 
invention. 

DETADLED DESCRIPTION OF EMBODIMENTS OF THE INVENTION 

The invention provides system, apparatus, method, computer program, and 
25 computer program product for monitoring and improving the performance of 
messages communicated over an interactive measurable medium, such as the 
Intemet. The term optimizing is used to describe the attempt to improve 
performance though those workers having ordinary skill in the art will appreciate 
that while there may be only a single "optimum" which may not always be attained, 
30 there are many degrees of performance improvement that may be obtained. As used 
in this description, optimization conveniently means improvement rather than 
requiring attainment of any single optimum value. Put differently, optimization 
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refers to procedures, algorithms, and other attempts to attain optimum performance 



Sectional headers provided in this description are provided merely for the 
5 purpose of assisting the reader in understanding where a primary description of a 
particular aspect of the invention is described. It is understood that aspects of the 
invention are described throughout the description, drawings, and claims and that the 
headers do not limit the description in any way. 

10 The messages that are being optimized may be any type of message or 

communication and include by way of example but not limitation messages such as 
advertisements including web site banner ads, marketing campaigns, political 
campaigns, surveys, sales and promotions of various types, public service 
announcements, electronic mail or email, pop-up windows and the like. Usually, 

15 there will be several message alternatives. 



In simplified terms, one embodiment of the invention provides an 
optimization system and procedure that selects (or selects more fi^equently) and 
utilizes one or more well performing message alternatives and deselects (or selects 

20 less fi"equently) poorer performing message alternatives from among the available 
message alternatives based on the past performance of at least some of the other 
message alternatives. However, recognizing that a particular message alternative's 
performance may change (improve or degrade) over time, in at least some 
embodiments of the invention, even poorer performing message altematives may be 

25 allocated some share of the total number of messages to be sent. The inventive 
system and method are adaptive and leam continuously. In some embodiments, 
newer performance information is weighted more heavily than older performance 
information, and the manner in which this weighting function is applied is selectable 
and programmable as are many other aspects of the inventive procedure. 



30 
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Among the many features that distinguish the inventive system and method 
from conventional ones, the invention provides a procedure that recommends and 
allocates messages to and among a plurality of different message alternatives at 
intervals of time. These intervals of time are referred to here as "stages" and though 
5 they may be any arbitrary intervals of time, the stages are desirably regular intervals 
of time. For example stages may be 6-hour intervals, 12-hour time intervals, one per 
day, one per week, or any other interval. Usually, the intervals should be long 
enough to allow collection of significant performance data, and not so long that 
important short-term trends may be missed. In developing this recommendation and 
10 allocation, one embodiment of the inventive procedure uses the available 
performance or response data from all prior stages in the message campaign. In 
another embodiment, the performance or response data from prior stages may be 
m discarded after a predetermined number of stages have passed or the performance or 

7] response data may be weighted to increase the contribution for recent data and 

S 15 discoimt the contribution for older data. In either case, the inventive procedure tries 

^ ' to maximize performance, which may be indicated by the total number of positive 

responses generated by the message set. 

y ^ 

To achieve these benefits, the inventive procedure spreads messages to all or 
S 20 most of the message alternatives, in the earlier stages, to discover high performing 
alternatives. As more information is available about the performance of these 
message alternatives, a higher concentration of messages are sent to better 
performing message alternatives, where better performing describes relative 
performance between the message alternatives sent. The inventive procedure can 

25 also robustly handle reasonable fluctuations in the underlying performance of a 
message alternative without deteriorating performance significantly. This later 
characteristic is beneficial because performance of a message altemative may 
change over time due to seasonal fluctuations or other reasons. Empirical and 
heuristic evidence, including computer simulation, have demonstrated the 

30 effectiveness of the inventive procedure in improving performance over that 
achieved by simplistic allocation strategies. Some of these simplistic allocation 
strategies include, for example distributing the messages uniformly over all the 
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different message alternatives or assigning most of the messages to the best 



5 types, the description below for purposes of illustration focuses on advertising 
messages, and web site banner ads in particular. The general principles, algorithms 
and procedures, and parameters described relative to banner ads (or banner ads) 
apply as well to messages of other types in interactive measurable digital 
environments. 



A marketing or ad campaign involves a list of ad alternatives (a particular 
type of message altemative) and a target customer population. The goal of the 
marketer is to allocate the ad altematives to the customer population to optimize 
business objectives such as maximizing the number of responses received. In an 
15 embodiment of the invention directed to advertising, the invention provides 
procedure and algorithm for optimizing the performance of marketing campaigns in 
which advertisements are distributed over an interactive, measurable medium such 
as the Intemet. One particular type of ad that is a very popular type of 
advertisement is the intemet web site banner ad. 



A banner ad, or more simply "banner," is an on-line ad the advertiser wants 
displayed at one or more zones. An advertiser buys ad space (advertising space) on 
at least one, but usually on a nimiber of web sites for the banner. The ad buy on 
each site consists of a number of zones, where the zones may for example be 
25 different sections or locations within that site, or more generally locations anywhere 
on the entire network. We will use the term "zone" to specify a location at which a 
banner ad or other message can be displayed on a given page of a given Intemet site. 
During a banner ad campaign, the advertiser pays to have banners shown at one or a 
collection of zones. 




While the inventive system and method may be applied to various message 



10 
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There are likely to be a number of banner ad alternatives that the advertiser 
wishes to deploy across these zones. For a given campaign, a marketer will provide 
a collection of banners that need to be assigned to the available impressions. The 
term "impression" is commonly used in the industry to refer to the occurrence of a 
5 banner ad on a Internet web site. A "click-through" occurs when a visitor to a zone 
clicks on a banner. Clicking-through a banner ad redirects the visitor to the page on 
the advertisers web site. Impressions occur when a user visits the appropriate page 
of a site where the banner is displayed. The term "message alternative" is used more 
generically for other message types, where a particular " message" is selected from 

10 available message alternatives. In an analogous manner, an "email" is selected from 
available "email alternatives," a "pop-up window altemative" is selected from 
available "pop-up window alternatives," a "web page" is selected from available 
"web page alternatives," and a "web layout" is selected from available "web layout 
alternatives." Using this syntax, a "banner" is selected from "banner alternatives." 

15 The phrases "message impression," "email impression", and "web page impression" 
though being analogous to "banner impression" have not been adopted in the field, 
rather the terms "email", "message", and "web page" themselves correspond to the 
impression. 

20 Reports providing information as to the performance of such banner ads are 

generated or updated at discrete times. The intermediate time between two reports is 
referred to as a stage. At the end of each stage, the ad performance results are 
available for that stage. In particular, ad performance indicators such as the number 
of impressions delivered and the number of clickthroughs are available for each 

25 banner at each zone. 

The invention provides an automated system for optimizing ad allocation 
parameters and executing them on the Internet or other communication system or 
media. The invention can also be used to optimize across the various other 
30 parameters that can be selected and configured in ad delivery systems. For example, 
ad delivery systems may typically allow an advertiser to display different banners 
based on the time of the day, the profile of the user browsing the web site (for 
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example, their geographic location and other demographic information), as well as 



MVft vrx AAXfJkXXV^LXXX.^ tUX^WI.XXX^ WXXLWXXCX* 



A marketing manager or other person responsible for directing the ad 
campaign may typically impose certain constraints on where or when certain 
banners may be displayed. For example, a given banner might be excluded from 
appearing at a specific zone during certain stages of the ad campaign. This type of 
constraint may reflect the fact that an advertiser's buy for one zone covers different 
stages than the buy at another zone. Or, a particular banner may just not be 
appropriate for some zones. These are merely illustrative of the constraints that may 
be imposed and it will be clear to those workers having ordinary skill in the art that 
virtually any parameter or condition associated with placing ads may be constrained 
by the marketing manager or other responsible person. In the context of the 
inventive structure and method, the ability to constrain one or more ad campaign 
allocation parameters, conditions, or other attributes is desirable so that any 
constraints that might not normally be constrained by the optimization procedure are 
understood to be constrained and taken into account. 

One embodiment of the inventive optimization method may be utilized in 
conjunction with a system and method for optimizing the performance of messages, 
embodiments of that are described in United States Patent Application Serial No. 

/ , (Attorney Docket No. A-69257/RMA), filed _ May 2000 and entitled 

System, Method, and Operating Model for Optimizing The Performance of Messages 
In An Interactive Measurable Medium, which is hereby incorporated by reference. 
The inventive optimization method and algorithm may also be utilized in 
conjunction with a system and method for improving the performance of electronic 
mail and electronic mail systems, embodiments of which are described in United 
States Patent Application Serial No. / (Attorney Docket No. A- 

68760/RMA/LXM), filed May 2000 and entitled System and Method for 

Optimizing The Performance of Email and Other Message Campaigns^ which is 
hereby incorporated by reference. However, those workers having ordinary skill in 
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the art in light of the description provided here will appreciate that the invention is 
not limited to these particular exemplary applications, 

The invention can learn and optimize across the various other parameters that 
are available for configuring message or ad server systems. For instance, ad server 
systems permit display of different banner ads based on the time of the day the 
profile of the user browsing the web site (e.g. their geographic location and other 
demographic information), as well as other targeting criteria. The inventive 
optimizing system and method leam and then set the appropriate parameters across 
multiple content and placement dimensions so as to optimize the objective specified 
by the message provider, such as an advertiser. 

Those workers having ordinary skill in the art in light of the description 
provided here will appreciate that the system as described above can be easily 
extended to be useful for optimizing other message types, including, for example: 
(1) email alternatives, (2) pop-up window alternatives, (3) web page altematives, 
and (4) web layout altematives. 

In one aspect, the invention provides a system for optimizing ads or other 
messages in an interactive measurable medium and automating the configuration of 
an ad server system in a larger interactive medium, such as the internet. The system 
includes an optimizing engine typically within a server of conventional type having 
a processor, a memory coupled to the processor, and a local data storage, along with 
input/output and other network communication devices for commimicating data and 
commands to other computers or information appliances. The optimization engine 
that implements the inventive optimization procedure in software takes as input an 
ad performance report or other performance data and generates configuration 
parameters for the optimal placement and targeting of ads. In one embodiment, it 
loads the performance reports from a database and stores the parameter values into 
the database. The database stores user-configured information, optimization engine 
generated configuration and the performance reports from the ad server system. An 
advertising server interaction service is operative to load performance reports from 
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the ad server system and store them in the database as well as taking user or 
optimization engine generated settings and setting them into the ad ser^/er system. 
In the system implementation, the user interface allows the user to enter campaign 
data and specifications about the optimization. Additional features of such a system 

are described in United States Patent Application Serial No. / (Attomey 

Docket No. A-69257/RMA), filed May 2000 and entitled System, Method, and 

Operating Model for Optimizing The Performance of Messages In An Interactive 
Measurable Medium, 

As the inventive method is advantageously implemented as a computer 
program on a general purpose computer, such as for example a computer operating 
in an Internet or other network server environment, attention is now directed to an 
exemplary computer system which may be used in conjunction with the inventive 
method, procedure, computer program, and computer program product. 

With respect to FIG. 1, computer 102 includes a processor or CPU 104 
coupled to a fast random access memory 106 storing data 108 and procedures 110. 
Processor 104 is also conveniently coupled to one or more mass storage device 112 
such as one or more hard disk drives of which many types are known. The mass 
storage devices may typically store information in one or more databases. In the 
context of the present invention, such databases may store performance data, 
allocation data, and other data and/or procedures or algorithms that are not currently 
being processed by the processor 104 or memory 106. Input/output devices 114 
may also be conveniently coupled to the processor or other computer system 102 
components for inputting data to the computer, retrieving results from the computer, 
monitoring activity on the computer or on other network devices coupled to the 
computer, or other operations as are known in the art. A network interface 116, such 
as a network interface card (NIC), modem, or the like is also conveniently provided. 

Procedures 110 may for example, include an operating system 120, an 
optimization procedure 122, a state vector update procedure 124, a message 
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allocation procedure 126, a pair- wise or other message alternative comparison 
procedure 128, as well as other procedures and algorithms 130. 

Data 108 may for example include one or more state vector 142, results 
5 vector 144, allocations vector 146, constraints list or constraints items 148, stage 
discount factors or parameters 150, threshold values 152, proportional of 
impressions or trials factors 154, or other data or parameters 156. 

Those workers having ordinary skill in the art in light of the description 
10 provided here will appreciate that the computer system 102 described here is merely 
exemplary and that various modifications may be made, or that other computer 
system architectures may be used in conjunction with the inventive system, method, 
and computer program. 

15 Optimization Procedure and Algorithm 

Having described certain high level features of the inventive optimization 
procedure as well as providing an exemplary environment in which the inventive 
procedure has particular applicability, attention is now directed to details of a 
particular embodiment of the inventive algorithm or procedure. 

20 

This embodiment is described in the internet banner ad context. In this 
application context, three principal types of data are used by the inventive 
procedure: (i) State Vector, (ii) Results Vector, and (iii) Allocations Vector which 
includes a Constraints List. (In one embodiment, the Constraints List is separate 
25 from the Allocation Vector, but used to generate the Allocation Vector.) These data 
types are described in turn. The term vector is used to refer to one form in which the 
data is stored and/or manipulated. 

Zone/banner States and State Vector. For each combination of a zone and a 
30 banner, the inventive procedure retains a "state vector" that records all of the 
information that has been collected on the given banner at the given zone 
(zone/banner combination). This state vector is updated at the end of each stage of 
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the campaign to reflect the results of that stage. In particular, the state vector 



Results Vector. The results vector is a vector of the results of the most 
5 recent stage of the campaign. It contains the unprocessed impression and 
clickthrough data for each zone/banner combination. If a banner was not displayed 
at some zone during the most recent stage, the results vector indicates this fact. 

Allocations Vector. The allocations vector contains the recommended 
10 allocation of banners to impressions for the next stage of the campaign. Intuitively, 
these allocations are expressed in percentage terms. However, in practice the 
recommended allocations may be re-expressed in a form acceptable to the ad server. 
Hence, some translation or transformation may be provided. The Allocations Vector 
may also include a list of constraints, though the constraints list may be separately 
15 provided. 

A list of any constraints on the allocation of banners to impressions is stored 
in the constraints list. At least two types of constraints can occur. First, the user can 
specify the minimum number of banners that must be retained and displayed at each 
20 stage of the ad campaign. Second, the inventive procedure implemented as a 
computer program on a general-purpose computer retains a table of "banner 
exclusions." The table of exclusions, includes for example, a table defined as a data 
structure in memory that indicates when a given banner should not be shown on a 
given zone. This table may include one or more entries or may be empty. 

25 

The inventive procedure is advantageously run at the end of each stage of an 
ad campaign. It allocates the available banner impressions or banner ad alternatives 
at each zone for the next stage among the available baimers. That is, it generates a 
strategy or reconrunendation for displaying banners during the next stage of the 
30 campaign. In developing this recommendation, the algorithm uses the results from 
all prior stages. (Though in at least one embodiment, a selected range of data from 
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older stages may optionally be discarded, and in a different embodiment, not all data 
from all stages has the same weight or value in the recommendation.) One objective 
is to allocate the banners at all stages so that the total number of positive results, 
such as user clickthroughs on a web site, recorded at all zones by all banners during 
the campaign is maximized. Other operations may optionally be performed, for 
example, during the ad campaign, the client advertiser or marketing manager may 
wish to delete or add a zone or banner. 

An embodiment of the optimization procedure 202 is now described relative 
to the flow-chart diagram of FIG. 2. First, the prior stage message state is read (Step 
204). Next, message performance results are read (Step 206). These performance 
data or results are typically received from another source, such as an ad service and 
stored (Step 208). The current (or new) message state is then computed (Step 210) 
using, for example, the state vector update procedure described in greater detail 
elsewhere herein. Message allocations, possibly with some format transformation to 
adapt them to the particular format needs of a receiver, are generated (Step 212) and 
sent to the receiver such as a message service (Step 214). As a primary application 
of the inventive method is for Internet advertising, the message service will 
frequently be an ad service. As the message campaign (ad campaign) is generally a 
multi-stage campaign, the procedure is repeated until all stages in the campaign have 
been completed. This requires that the prior message state used in step 204 be 
replaced with the current message stage (Step 216) generated in step 210 before the 
next iteration (Step 218). New message performance results received in step 206 are 
also used for the next iteration. 

While it is generally intended that performance data or results are received 
for each prior stage before allocations are generated for the current or next stage in 
the campaign, and the inventive procedure benefits from such timely receipt, the 
invention does not require such timely receipt. In the event that expected prior stage 
results are not for some reason received as expected, the inventive procedures are 
sufficiently robust that the new allocations are merely based on the previous 
performance data or results. Therefore, when new performance results are not 
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available, the old performance results are read or otherwise used in the 
computations. In some instances this may result in the same or substantially the 
same allocations as the previous stage, however, where discounting is applied, the 
allocations may not be the same. The extent of the difference may typically depend 
5 upon the discounting function and the historical prior performance results to which 
the discounting is applied. 

Each of these operations is now described in greater detail. For purposes of 
clarity of description, it is assimied that the procedure is applied relative to a single 
^ 10 zone. If multiple zones are available, the procedure is applied to each of the 

multiple zones one-at-a-time to derive the allocations for that zone. The banner ad 
performance may improve or deteriorate over time so that generally the performance 
P I of an ad is non-stationary in a statistical sense. There are a number of potential 

03 reasons for a banner to have non-stationary behavior in the underlying performance. 

15 For example, a banner that promises ovemight delivery may be quite effective 
2 shortly before Christmas, but much less effective on the day after Christmas. Even 

□ absent a particular identifiable event, the performance of ads may change over time. 

For example, Intemet visitors may see a given banner too many times and beyond 
O some point the banner loses all attraction. As the fraction of visitors who have seen 

20 the ad too many times increases the performance of the banner deteriorates. 

State Vector Update Procedure 

One embodiment of the inventive procedure optionally but advantageously 
uses a "discounting" of the performance data in computing or updating the State 

25 Vector. Such discounting may be in the form of a binary step function where data 
older than a certain stage is simply ignored, in the form of a linear or non-linear 
weighting function, or according to any other weighting or discovmting scheme. In 
one embodiment, the discovmt is in the form of a geometrical discounting, that is, at 
the end of each stage all data (such as for example, the number of impressions and 

30 nvmiber of clickthroughs) is multiplied by a one-stage discount factor beta, B, where 
beta is a number less than one (B <1). Thus, data that is n stages old at the time 
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when the procedure is executed will end up being multiplied by beta raised to the n 
power (B In this manner newer performance data is weighted more highly than 
older performance data. While not true in all circumstances, it is generally true that 
newer performance data provides better guidance as to what allocation scheme will 
5 be successful in the next stage than will older performance data. In any particular 
situation where this generalization does not hold true, different discounting schemes, 
including for example a discounting scheme that weights some segment of 
performance data more heavily than newer performance data, may be applied. If no 
discounting is desired, beta may be set to 1 (B=l). (Empirical and simulation studies 
10 have shown that a value of the one-stage discount factor (beta) of about 0.9 works 
well for a range of synthetic and real data sets, though values between about 0 and 
about 1 may be used, more typically between about 0.5 and 0.99, more usually 
between about 0.8 and about 1.0, and even more usually between about 0.85 and 
0.95 may be used.) 

15 

Let Si(t) denote the total number of discounted cumulative clicks and ni(t) 
denote the discounted cumulative number of impressions for banner i at the end of 
stage t. Let impi(t) denote the total number of impressions and clicki(t) represent 
the total number of clicks and impressions for a particular banner i in stage t. Then 
20 the discounted cumulative clicks Si(t) and the discoxmted cumulative impressions 
ni(t) are given by the following expressions: 

Si(t) = B Si(t-l) + clicki(t-l), for t ^ 2 ; and 

25 ni(t) = B ni(t-l) + impi(t-l), for t ^ 2; 

where Si(l) = 0 and ni(l) = 0. Thus, Si(t) and ni(t) represent two components of the 
State Vector. Discounting reduces the impact of old data and allows the inventive 
algorithm to be more responsive to new data that may reflect changes in the 
30 clickthroughs rates or other performance indicators. 
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The above expressions are specialized for an embodiment in which the 
performance metrics are based on numbers of "impressions" and "clickthroughs," 
where for a particular impression the viewer has the binary choice of either clicking- 
though or not clicking through for each impression. 

5 

In the set of equations above, the discounting is provided by the factor B. 
The above expressions can be easily generalized to an arbitrary or generalized 
discounting function G{...} which uses the clickthrough impressions from the 
previous stages to derive the discounted cumulative clicks Si(t) and the discounted 
10 cumulative impressions ni(t) using the following expressions: 

Si(t) - G{ clicki(l), clicki(t-2), clicki(t-l)}, for t ^ 2 ; and 

ni(t) = G{ impi(l), impi(t-2), impi(t-l)}, for t ^ 2 ; 

15 

where Si(l) = 0 and ni(l) = 0. In this generalized discounting function G {...}, it is 
noted that none, some, or all, of the clicki(t) and/or impi(t) may actually weighted or 
discounted, though in the preferred embodiment each is discounted. 

20 In particular functions which give more weight to more recent values of Si 

and ni are of interest. When a different binary response performance indicator than 
clickthroughs such as positive responses or "successes" are used, and a more general 
term applied to impressions such as "trials" is used, along with a generalized 
discounting function H {,..}, the general expressions for the total number of 

25 discoxmted successes Si(t) and the cumulative number of impressions ni(t) are given 
by the following expressions: 

Si(t) == H{ successi(l), successi(t-2), successi(t-l)}, for t ^ 2 ; and 



30 



ni(t) = H{ triali(l), triali(t-l), triali(t-l)}, fort ^ 2. 
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where Si(l) = 0 and ni(l) = 0. 

Thus, Si(t) and ni(t) represent the State Vector for banner As noted above for 
G {...}, none, some, or all, of the successi(t) and/or triali(t) may actually weighted or 
5 discounted by the discounting function H {...}, though in the preferred embodiment 
each is discounted. 

In some cases, before the inventive system or method are used for 
optimization, a campaign may already be executing. In this case, information may be 
10 available regarding the performance of the banners. This performance information 
can be used to initialize Si(l) and ni(l) as follows. A cumulative or discoimting 
5 procedure as described in the previous section can be use to estimate Si(l) and ni(l) 

in the same fashion as Si(t) and ni(t) are calculated, for example, by either using the 
P cumulative sum or some form of discounting. Those workers having ordinary skill 

-^j 15 in the art in light of the description provided here will appreciate that there are 

several similar variations of the above basic strategy. 

S' 

m 

Q Even when the campaign is started fresh, a campaign manager may have 

used the same banner in an earlier campaign or may have a prior judgement about 

O 20 the performance of the banner. In this case the value of Si(l) and ni(l) for banner i 

can be set as follows. The manager can provide an estimate of the average 
performance of meani for the banner i over the previous campaigns. He/she may 
decide that the information from the previous campaign is worth Ni impressions. In 
this case Si(l) can be set to (meani x NO and ni(l) can be set to Nj. 

25 

The setting of Si(l) and ni(l) as described above can be used to incorporate 
managers prior experience or judgement of the banners. Larger values of Ni imply 
that the manager is more confident about extrapolating the future performance of the 
banner based on his prior experience or judgement. 
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Those workers having ordinary skill in the art in light of the description 
provided here will appreciate that there arc several similar variations of the ahove 
basic strategy 

AlloQ^tiQH CQmputatiQn PrQpgijyre 

Having now described several embodiments of the state vector update 
procedure, attention is now directed to the allocation computation procedure. At the 
initial stage (when no information has been collected on the banner ads), the 
inventive procedure may allocate the impressions in any one of several possible 
manners. Such initial allocation may be based on objective or subjective criteria. In 
one embodiment, an even or uniform initial allocation of impressions are made from 
among the available banners. That is, if there are a number b banners, then each 
banner will receive 1/b of the total number of allocated impressions. 

At each of the remaining stages, the inventive procedure uses the available 
performance data (for example, the record of impressions and clickthroughs from 
the earlier stages), allocation information, and other information that may be 
available on the banners to divide the available banners into two classes: (i) a first 
group referred to as the contenders, and (ii) a second group referred to as the non- 
contenders. The "contenders" are the banners that are still in contention according 
to some criteria to be declared the best banner. The "non-contenders" are the 
banners that appear out of the running for best banner based on the available 
information. There may also be a (iii) third group of "unavailable banners" if the 
constraint list indicates that some banners cannot be used on the given zone during a 
particular stage. Banners that cannot be used for the next stage during which the 
new allocations are to be made would be placed in the third group, as they cannot be 
displayed during the next stage independent of how well they might perform. 
Hence, those ads that are available for the next stage are divided into either the 
contender or the non-contender group. Any one of these groups may have none, 
one, or a plurality of ads. 
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Given this division of the banners into contenders and non-contenders, one 
embodiment of the inventive procedure allocates the impressions among the banners 
as follows. Let y<1 be a constant that represents the proportion of impressions that 
will be allocated to the non-contenders at each stage. This y is a constant selected at 
5 the beginning of the campaign. The value selected is not critical as the inventive 
procedure will reallocate between contenders and non-contenders at subsequent 
stages. The value may also be selected so that impressions are not allocated to non- 
contenders. Let b represent the total number of banners, bi denote the number of 
contenders and b2=b-bi the number of non-contenders. Then each contender 
10 receives the fraction (l-Y)/bi of the available impressions, while each non-contender 
receives the fraction yfbi of the available impressions. Using this approach, most of 
the impressions are spread evenly among the contenders and a few impressions are 
used to continue monitoring the non-contenders. As the campaign progresses the 
ff; number of contenders will generally decrease over time since the inventive system 

m 15 and method will have more information upon which to judge which banners are 

^ " more successful and which banners are less successful. 

C 

□ In an allocation task where there are five possible banners {Xi, X2, X3, X4, 

iij X5}, three of which are contenders {Xi, X2, X3} and two of which are non- 

O 20 contenders {X4, X5}, the Allocation Vector may therefore take the form of (0.3, 0.3, 

0.3, 0.05, 0.05) where the contenders each have a 30% proportion and the non- 
contenders each have a 5% proportion of the total. In a situation where there are six 
possible banners but one in constrained by the constraint list for a particular stage, 
that banner would have a zero proportion, for example, allocation vector of the form 
25 (0.3, 0.3, 0.3, 0.05, 0.05, 0.0). 



U1 



Exemplary Pair- Wise Test P rocedure to Identify^ Contenders and Non-Contenders 

In one embodiment, the division of banner ads into contenders and non- 
contenders is based on the following pair-wise test procedure for comparing two 
30 banners. When more than two banners are involved in a campaign a pair-wise test is 
used for each possible pair or combination of banners. 
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For each pair-wise test, suppose that two banners are given and let pi (for 
i=l, 2) denote the unkno^vn clickthrough probability of the i*^ banner. Let nj denote 
the total number of impressions for the i**' banner, Si the total number of successes 
and fi the total number of failures. (So, Si + = Uj). Finally, define: 

Z = (niSi - niS2)/(ni + n2) , and 

V = (nmz (si + S2)(fi + f2)/(ni + n2)^ 

It can be shown that Z is approximately normally distributed with variance V and 
mean 0V, where: 

0 = log((pi(l-p2))/(p2(l-pi))). 

The pair-wise test attempts to determine or test whether the two unknown 
clickthrough rates for the two banners under consideration are equal. If they are 
equal, then 0= 0 and Z is normally distributed with mean 0 and variance V. The 
distance between the observed value of Z and 0 (the mean of its distribution) 
measured in standard deviations is given by Z divided by the square root of V. If 

this statistic (Z/vV) is greater (in absolute value) than some cutoff threshold value a, 

then the procedure concludes that the click-through (or other performance indicia) 
rates are significantly different from each other. In one embodiment, the user (such 
as for example the marketing manager) specifies the cutoff threshold value a prior to 
the campaign; however, the value may be modified during the campaign. In one 
embodiment, the value of the cutoff threshold value a is selected to be between 
about V2 and about 1, based on empirical simulation results, but other values may be 
used. In one embodiment a cutoff threshold value a of 0.5 is used. 

The pair wise test is applied to all pairs of banners. If a particular banner 
loses one of these comparisons (that is, if a banner is determined to have a 
significantly lower click-through rate than another banner), then that banner is 
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designated as a non-contender. This pair-wise procedure guarantees that at least one 
contender will remain. 



Exemplary Procedure For More Than Two Sets of Alternatives 
5 The algorithm described above divides the alternatives into two sets: 

contenders and non-contenders. The basic approach can be extended to decompose 
the alternatives into more than two sets. Consider the pair-wise test between 
altemative i and alternative j. Let Compare(ij) represent the Z(ij)/sqrt(V(ij)) 
statistic where: 

10 

Z(i j) = (UjSi - niSj)/(ni + nj) , and 

V(i j)= (mnj (si + Sj)(fi + fj)/(ni + nj)3 . 

15 The value of Compare(ij) is positive if the performance of altemative i is better than 
altemative j and is negative if the performance of banner j is better than banner 
altemative i. Define Perf(k) = maxi<i<n Compare(i,k). The algorithm deems an 
altemative k to be a non-contender if Perf(k) is above the cutoff threshold, otherwise 
k is a contender. 

20 

The above basic approach can be extended to decompose the alternatives into 
more than two sets. Under these circumstances, the altematives are divided into 
several sets based on the corresponding Perf values. The impressions are allocated 
to different altematives such that the altematives for sets corresponding to higher 
25 values of Perf get smaller numbers of impressions. However, the number of 
impressions allocated to all altematives within a set is equal. In the extreme case, 
the number of sets is equal to the number of altematives. In this case, each 
altemative gets a different fraction of the total number of impressions (or trials). 
This approach is formalized in the next section. 
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Rxemplarv Procedure for Dividing Alternatives into Multiple Sets 

iiiG v aiUC \^uiiipan^\^i ii> pwoinvw ix mv jjwi Aviiixa-iiv/v v/x ai tx^x ncii-i v i 

better than alternative j, and the value of Compare(i j) is negative if the performance 
of banner j is better than banner alternative i. Define a performance function 
5 (Perf(k)) to be the maximum of Compare(i,k) over the range of i from i=l, n; 
that is, Perf(k) = maxi<i<n Compare(i,k). The algorithm deems an alternative k to be 
a non-contender if Perf(k) is above the cutoff threshold a, otherwise k is deemed to 
be a contender. 

One embodiment uses the value of Perf(k) to segment the banners into two or 
more sets (r > 2 sets). In this context, larger values of the performance function 
Perf(k) as defined here will generally imply poor relative performance of banner 
alternative k. Let there be user defined thresholds ai, ai, ... cx(r.i) such that a first 
threshold Oo =0, and the remaining thresholds are each greater than oo, that is 0=ao 
< ai < a2 <...< a(r-i) < ar=oo. The algorithm assigns a banner k to set j, where 
l<j<r, if a(j.i) < Perf(k) < aj. 

The impressions are allocated to different alternatives such that the 
altematives for sets corresponding to higher values of Perf get smaller fi-action of 
impressions. However, the number of impressions allocated to all altematives within 
a set is equal. In the extreme case, the number of sets is equal to the number of 
altematives. In this case, each alternative gets a different flection of the total nvmiber 
of impressions (or more generally, total number of trials). 

25 Those workers having ordinary skill in the art in light of the description 

provided here will appreciate that there are several similar variations of the above 
basic strategy that can use the Compare function to segment the banner into multiple 
sets. 
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One embodiment of the allocation algorithm is as follows. Define constant 
propi, prop2, prop3, propr such that propi < propa < props ^ propr. Propj 
represents the proportion of total impressions assigned to all the banners belonging 
to set j, where l<j<r. Let the number of impressions belonging to set j be given by 
numj, where 1 < j < r. (Note that the Sum of numj = total number of banners). Then 
banner belonging to set i is assigned the fraction of total banners = Propi /{Zi<j<r 
(numj X propj)}. 

Unlike the allocation strategy for two sets, the above allocation does not 
guarantee a fixed fi-action of impressions to be allocated to relatively under- 
performing sets. However, it guarantees that banners belonging to relatively poor 
performing sets get smaller number of impressions as compared to relatively higher 
performing sets. 

Those workers having ordinary skill in the art in light of the description 
provided here will appreciate that there are several similar variations of the above 
basic strategy using the Z and V statistics for the pair-wise comparisons which can 
be used for decomposing the alternatives into two or more sets. 

Procedure For Adding or Deleting Zones or Banners 

As mentioned previously, additional operations may be required when zones 
and/or banners are added or deleted. If a zone is deleted, then the number of 
available impressions for this zone is set to 0, and no further banners are allocated at 
this zone (imless the zone is re-introduced later). However, the data collected during 
earlier stages at this zone can be retained. Recall that in some embodiments, the 
earlier collected performance data is aged or discounted so that more recent 
performance data is given greater weight in the computations. If a banner is deleted, 
then this information is added to the constraint list so that the banner is no longer 
placed or not placed for a particular zone. Note that a banner may be deleted at one 
or more zones but remain for placement at other zones. After zones or banners have 
been deleted, all inputs to the inventive allocation computation procedure (for the 
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remaining zones and banners) are available and allocation for the next stage is 
determined. 

If a zone is added, any constraints that will affect the new zone are specified. 
If a banner is added, the new state vector is initialized (for example, with an empty 
history) and any constraints on the banner are specified. In one embodiment, at the 
next stage, this added banner is treated as though it were as good as the best existing 
banner, and is allocated the same number of impressions as the best banner (for each 
zone). However, this can be easily generalized to allocating more (or less) 
impressions than the best banner. Following this initial stage of data collection for 
the new banner, the allocation algorithm is applied as usual. 

Additional and Alternative Embodiments 

It can therefore be seen from the above description that the inventive system 
and method provide particularly advantageous benefits in the banner ad messaging 
context. The methodology applied to banner ads is now extended to other 
messaging situations. 

Electronic Mail 

Email is an important medium for advertising and marketing. Many 
businesses now readily use email to acquire new customers, build brands, advertise 
and promote products, measure consumer satisfaction and manage customer 
relationships. A typical email campaign involves sending emails to each address on 
a list of recipients. The list may typically be bought or otherwise acquired from an 
outside firm or collected intemally over a period of time. The procedures and 
algorithms developed above can be easily extended for optimizing the responses 
generated by emails. A system and method for improving the performance of 

Emails is described in United States Patent Application Serial No. / 

(Attorney Docket No. A-68760/RMA/LXM), filed May 2000 and entitled System 

and Method for Optimizing The Performance of Email and Other Message 
Campaigns^ which is hereby incorporated by reference. 
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The inventive procedures and algorithms may, for example, be extended to 
the email environment by decomposing the email process into multiple stages and 
dividing the measurement process into multiple stages. In each stage a fraction of 
5 the total emails to be sent out to each alternative is given by the allocation algorithm 
described above. The major difference in an email application as compared to the 
afore described banner ad application is that there is a non-uniform delay between 
the time an email is sent out and the time a response is received. (For banner ads 
this delay is either non-existent for practical purposes or fairly uniform, for example 

10 the delay for a banner ad clickthrough may be between about 1 second and 60 
seconds, well within one stage.) However, one can measure the total number of 
responses received for a given email altemative by the end of a given stage and the 
total number of emails sent for that altemative as a way of measuring the 
performance of a given altemative. This information can then be used by the 

15 algorithm to determine and allocate the better alternatives and optimize the total 
number of responses over the entire campaign. The duration of the stage should 
advantageously be chosen to ensure that a reasonable fraction of overall responses 
are received by the end of each stage and the number of stages are large enough that 
the algorithm can leam the better performing altematives in the earlier stages and 

20 exploit this information in later stages. 

Each of the inventive methods, algorithms, and procedures may be 
implemented as a computer program, also referred to as software or firmware, 
programs or code, on a general purpose or specialized computer. Such computers 

25 routinely include a processor or CPU, a fast random access memory coupled to the 
processor in which is defined a data structure storing program procedures and data 
associated with executing the programs or procedures in the processor. Such 
procedures include the inventive state vector update, pair-wise comparison, and 
allocation procedures, among other procedures described herein. An operating 

30 system is also typically provided. When implemented as a computer program or 
software or firmware, the invention also includes the program, software, and/or 
firmware in addition to a computer program product. Such computer program 
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product may tangibly embody the computer program on a machine readable medium 
such as a magnetic disc, CD-ROM, DVD, memory card, or any other media, or be 
stored and conmiunicated electronically. For example, the inventive computer 
program may be communicated electronically or optically over a communication 
5 link, such as for example over the Internet. 

The foregoing description, for purposes of explanation, used specific 
nomenclature to provide a thorough understanding of the invention. However, it will 
be apparent to one skilled in the art in light of the description provided that the 
10 specific details are not required in order to practice the invention. Thus, the 
foregoing descriptions of specific embodiments of the present invention are 
presented for purposes of illustration and description. They are not intended to be 
exhaustive or to limit the invention to the precise forms disclosed. Obviously many 
modifications and variations are possible in view of the above teachings. 

15 

The embodiments were chosen and described in order to best explain the 
principles of the invention and its practical applications, to thereby enable others 
skilled in the art to best utilize the invention and various embodiments with various 
modifications as are suited to the particular use contemplated. It is intended that the 
20 scope of the invention be defined by the following claims and their equivalents. All 
patents, publication, or other references referred to herein are hereby incorporated by 
reference. 



